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Pm ■ Abstract 

Oh 
^ ■ We prove that the linearly edge reinforced random walk (LRRW) on any graph 

with bounded degrees is recurrent for sufficiently small initial weights. In con- 
trast, we show that for non-amenable graphs the LRRW is transient for sufficiently 
large initial weights, thereby establishing a phase transition for the LRRW on non- 
amenable graphs. While we rely on the description of the LRRW as a mixture of 
Markov chains, the proof does not use the magic formula. We also derive analogous 
^ ' results for the vertex reinforced jump process. 

Tt ! 1 Introduction 

en 

O ' The linearly edge reinforced random walk (LRRW) is a model of a self -interacting (and 

^ ' hence non-Markovian) random walk, proposed by Coppersmith and Diaconis, and de- 

fined as follows. Each edge e of a graph G = {V,E) has an initial weight a^ > 0. A 
starting vertex vq is given. The walker starts at ^o- It examines the weights on the edges 
rS ■ around it, normalizes them to be probabilities, and then chooses an edge with these 

C^ , probabilities. The weight of the edge traversed is then increased by 1 (the edge is "rein- 

forced")- The process then repeats with the new weights. 

The process is called linearly reinforced because the reinforcement is linear in the 
number of steps the edge was crossed. Of course one can imagine many other reinforce- 
ment schemes, and those have been studied (see e.g. [17] for a survey). Linear reinforce- 
ment is special because the resulting process is partially exchangeable. This means that if 
a and j6 are two finite paths such that every edge is crossed exactly the same number of 
times by oc and /3, then they have the same probability (to be the beginning of an LRRW). 
Only linear reinforcement has this property. 
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Diaconis & Freedman [2, Theorem 7] showed that a recurrent partially exchangeable 
process is a mixture of Markov chains. Today the name random walk in random environment 
is more popular than mixture of Markov chains, but they mean the same thing: that there 
is some measure f/ on the space of Markov chains (known as the "mixing measure") such 
that the process first picks a Markov chain using f/ and then walks according to this 
Markov chain. In particular, this result applies to the LRRW whenever it is recurrent. 
There "recurrent" means that it returns to its starting vertex infinitely often. We find this 
result, despite its simple proof (it follows from de Finetti's theorem for exchangeable 
processes, itself not a very difficult theorem) to be quite deep. Even for LRRW on a graph 
with three vertices it gives non-trivial information. For general exchangeable processes 
recurrence is necessary; see [2, Example 19c] for an example of a partially exchangeable 
process w^hich is not a mixture of Markov chains. For LRRW this cannot happen, it is a 
mixture of Markov chains even when it is not recurrent (see Theorem 4 below). 

On finite graphs, the mixing measure ji has an explicit expression, known fondly as 
the "magic formula". See [11] for a survey of the formula and the history of its discovery. 
During the last decade significant effort was invested to understand the magic formula, 
with the main target the recurrence of the process in two dimensions, a conjecture dating 
back to the 80s (see e.g. [16, §6]). Notably, Merkl and Rolles [14] showed, for any fixed a, 
that LRRW on certain "dilute" two dimensional graphs is recurrent, though the amount 
of dilution needed increases with a. Their approach did not work for Z^, but required 
stretching each edge of the lattice to a path of length 130 (or more). The proof uses 
the explicit form of the mixing measure, which turns out to be amenable to entropy 
arguments. These methods involve relative entropy arguments which also lead to the 
Mermin- Wagner theorem [10]. This connection suggests that the methods should not 
apply in higher dimension. 

An interesting variation on this theme is when each directed edge has a weight. When 
one crosses an edge one increases only the weight in the direction one has crossed. This 
process is also partially exchangeable, and is also described by a random walk in a ran- 
dom environment. On the plus side, the environment is now i.i.d., unlike the magic 
formula which introduces dependencies between the weights of different edges at all 
distances. On the minus side, the Markov chain is not reversible, while the Markov 
chains that come out of the undirected case are reversible. These models are quite differ- 
ent. One of the key (expected) features of LRRW — the existence of a phase transition on 
Z'* for d > 3 from recurrence to transience as a varies — is absent in the directed model 
[18, 8]. In this paper we deal only with the reversible one. 

Around 2007 it was noted that the magic formula is remarkably similar to the for- 
mulae that appear in supersymmetric hyperbolic cr models which appear in the study of 
quantum disordered systems, see Disertori, Spencer and Zirnbauer [6] and further see 
Efetov's book [7] for an account of the utility of supersymmetry in disordered systems. 
Very recently. Sabot and Tarres [19] managed to make this connection explicit. Since 



recurrence at high temperature was known for the hyperbolic a model [5], this led to a 
proof that LRRW is recurrent in any dimension, when a is sufficiently small (high tem- 
perature in the a model corresponds to small a). We will return to Sabot and Tarres's 
work later in the introduction and describe it in more details. However, our approach is 
very different and does not rely on the magic formula in any way. 
Our first result is a general recurrence result: 

Theorem 1 For any K there exists some uq > such that if G is a graph with all degrees 
hounded by K, then the linearly edge reinforced random walk on G with initial weights a G 
(0, flo) is a.s. positive recurrent. 

Positive recurrence here means that the walk asymptotically spends a positive frac- 
tion of time at any given vertex, and has a stationary distribution. In fact, the LRRW 
is equivalent to a random walk in a certain reversible, dependent random environment 
(RWRE) as discussed below. We show that this random environment is a.s. positively 
recurrent. We formulate and prove this theorem for constant initial weights. However, 
our proof also works if the initial weights are unequal as long as for each edge e, the 
initial weight flg is at most Aq- With minor modifications, our argument can be adapted 
to the case of random independent a/s with sufficient control of their moments. See the 
discussion after the proof of Lemma 8 for details. 

Let us stress again that we do not use the explicit form of the magic formula in the 
proof. We do use that the process has a reversible RWRE description, but we do not 
need any details of the measure. The main results in [19] are formulated for the graphs 
Z** but Remark 5 of that paper explains how to extend their proof to all bounded degree 
graphs. Further, even though [19] claims only recurrence, positive recurrence follows 
from their methods. Thus the main innovation here is the proof technique. 

It goes back to [2] that in the recurrent case the weights are uniquely defined af- 
ter normalizing, say by Wi,(, = 1 (see there also an example of a transient partially ex- 
changeable process where the weights are not uniquely defined). Hence with Theorem 1 
the weights are well defined and we may investigate them. Our next result is that the 
weights decay exponentially in the graph distance from the starting point. Denote graph 
distance by dist( •, ■ ) . Also, let dist(e, Vq) denote the minimal number of edges in any path 
from Vq to either endpoint of e. 

Theorem 2 Let G he a graph with all degrees hounded hy K and let s S (0,1/4). Then there 
exists uq = ao{s,K) > such that for all a G (0,ao), ifei is the first edge crossed hy X, 

/ W \ ^ / \ dist(e,Oo) 

B{W,Y<1E(^-^J <2k(c(s,K)v/^) . (1) 

Note that aq does not depend on the graph except through the maximal degree. The 
factor 2K on the right-hand side is only relevant, of course, when dist(e, Vq) =0, other- 



wise it can be incorporated into the constant inside the power. 

The parameter s deserves some explanation. It is interesting to note that despite 
the exponential spatial decay, each W, does not have arbitrary moments. Examine, for 
example, the graph with three vertices and two edges. The case when the initial edge 
weights are 2 and the process starts from the center vertex is particularly famous as it is 
equivalent to the standard Polya urn. In this case the weights are distributed uniformly 
on [0, 1]. This means that the ratio of the weights of the two edges does not even have 
a first moment. Of course, this is not quite the quantity we are interested in, as we are 
interested in the ratio W^/Wf where Wj is the first edge crossed. This, though, is the 
same as an LRRW with initial weights a, a + \ and starting from the side. Applying the 
magic formula can show that the ratio has 1 + fl/2 moments, but not more. It is also 
known directly that in this generalized Polya urn the weights have a Beta distribution 
(we will not do these calculations here, but they are straightforward). Hence care is 
needed with the moments of these ratios. 

Our methods can be easily improved to give a similar bound for s < ^, with ^/a 
replaced by a suitably smaller power. See the discussion after Lemma 8 for details. Our 
proof can probably be modified to give 1/2 a moment, and depending on a, a bit more. 
Going beyond that seems to require a new idea. 

The most interesting part of the proof of Theorems 1 and 2 is to show Theorem 2 
given that the process is already known to be recurrent, and we will present this proof 
first in § 2. Theorem 1 then follows easily by approximating the graph with finite sub- 
graphs where the LRRW is of course recurrent. This is done in § 3. 

1.1 Transience 

An exciting aspect of LRRW is that on some graphs it undergoes a phase transition in 
the parameter a. LRRW on trees was analyzed by Pemantle [16]. He showed that there is 
a critical flc such that for initial weights a < Uc, the process is positively recurrent, while 
for a > Uc it is transient. We are not aware of another example where a phase transition 
was proved, nor even of another case where the process was shown to be transient for 
any initial weights. The proof of Pemantle relies critically on the tree structure, as in that 
case, when you know^ that you have exited a vertex through a certain edge, you know^ 
that you will return through the very same edge, if you return at all. This decomposes 
the process into a series of i.i.d. Polya urns, one for each vertex. Clearly this kind of 
analysis can only work on a tree. 

The next result will show the existence of a transient regime in the case of non- 
amenable graphs. Recall that a graph G is non-amenable if for some i > and any 
finite set A C G, 

\dA\ > l\A\ 

where dA is the external vertex boundary of A i.e. dA = {x : dist(x. A) = 1}. The largest 



constant l for which this holds is called the Cheeger constant of the graph G. 

Theorem 3 For any K,cq> Q there exists uq so that the following holds. Let Gbea graph with 
Cheeger constant l > cq and degree bound K. Then for a > ao the LRRW on G with all initial 
weights a on all edges is transient. 

Theorem 3 will be proved in § 4. As with Theorem 1, our proof works w^ith non-equal 
initial weights, provided that a^ > Aq for all e. It is tempting to push for stronger results 
by considering graphs G with intermediate expansion properties such that the simple 
random walk on G is transient. To put this in perspective, let us give some examples 
where LRRW has no transient regime. 

1. The canopy graph is the graph Zi+ with a finite binary tree of depth n attached at 
vertex n. It is often poetically described as "an infinite binary tree seen from the leaves". 
Since the process must leave each finite tree eventually, the process on the "backbone" 
is identical to an LRRW on Z+, which is recurrent for all a (say from [1] or from the 
techniques of [16]). 

2. Let T be the infinite binary tree. Replace each edge on the n* level by a path of 
length M^. The random walk on the resulting graph is transient (by a simple resistance 
calculation, see e.g. [4]). Nevertheless, LRRW is recurrent for any value of a. This is 
because LRRW on Z+ has the expected weights decaying exponentially (again from the 
techniques of [16]) and this decay wins the fight with the exponential growth of the 
levels. 

3. These two example can be combined (a stretched binary tree with finite decorations) 
to give a transient graph with exponential growth on w^hich LRRW is recurrent for any 

a. 

We will not give more details on any of these examples as that will take us off -topic, but 
they are all more-or-less easy to do. The proof of Theorem 3 again uses that the process 
has a dual representation as both an self-interacting random walk and as an RWRE. This 
might be a good point to reiterate that the LRRW is a mixture of Markov chains even 
in the transient case (the counterexample of Diaconis and Freedman is for a partially 
exchangeable process, but that process is not a LRRW). This was proved by Merkl and 
Rolles [13], who developed a tightness result for this purposes which will also be useful 
for us. Let us state a version of their result in the form we will need here along with the 
original result of Diaconis and Freedman; 

Theorem 4 ([2, 13]) Fix a finite graph G = {V,E) and initial weights a = (fle)eeE '^*^J^th 
ae > 0. Then X is a mixture of Markov chains in the following sense: There exists a unique 
probability measure jionClso that 



p= /pW4i(W) 



is an identity of measures on infinite paths in G. All W(e) > 0/or all e. 

Moreover, the W^'s form a tight family of random variables: if we set Wj, = Y,e3v^e and 
fli, = Le3D ^e' ihere are contestants Ci, 02 depending only on a^,, Ug so that 

^ {We/W^ <£) < cie"'^^ and }i {]Ne/y^v > 1 - e) < C2e(''"""'-'^^ 

As noted, we mainly need the existence of such a representation of the LRRW, as well 
as the tightness of the W's, but not the explicit bounds. 

It is natural to conjecture that in fact a phase transition exists on "ZJ^ for all d > 3, 
analogously to the phase transition for the supersymmetric a model. What happens 
in d = 2? We believe it is recurrent for all a, but dare not guess w^hether it enjoys a 
Kosterlitz-Thouless transition or not. 

1.2 Back to Sabot and Tarres 

The main object of study for Sabot and Tarres [19] are vertex reinforced jump processes 
(VRJP). Unlike LRRW, this is defined in continuous time process with reinforcement 
acting through local times of vertices. One begins with a positive function J = {Je)eGE on 
the edges of G. These are the initial rates for the VRJP process Yf, and are analogous to 
the initial weights a of LRRW. Let {Lx{t))xGV be the local times for Y at time t and vertex 
X. If Yf = X then Y jumps to a neighbor y with rate /i-t/(l + Ly{t)). See [19] for history 
and additional reference for this process. 

The VRJP shares a key property with the LRRW: a certain form of partial exchange- 
ability after applying a certain time change. This suggests that it too has a RWRE de- 
scription (Diaconis and Freedman [2] only consider discrete time processes, but their 
ideas should carry over). Such a RWRE description exists was found by Sabot and Tarres 
by other methods. The existence (though not the formula) for such a form is fundamen- 
tal for our proof. Their main results are the following: First, the law of LRRW with initial 
weight a is identical to the time-discretization of Yf when J is i.i.d. with marginal distri- 
bution r(a, 1). Secondly, after a time change, Yf is identical to a mixture of continuous- 
time reversible Markov chains. Moreover, the mixing measure is exactly the marginal of 
the supersymmetric hyperbolic cr models studied in [5, 6]. This is analogous to the magic 
formula for the LRRW. Finally, as already mentioned, this allowed them to harness the 
techniques of [5, 6] to prove that LRRW is recurrent for small a in all dimensions. 

Since the VRJP has both a dynamic and an RWRE representation, our methods apply 
to this model too. Thus we show: 

Theorem 5 Let G be a fixed graph with degree bound K. Let J = {Je)eeE be a family of 
independent initial rates with 

E/i/^ < c{K), 

where c{K) is a constant depending only on K. Then (a.s. with respect to J), Yt is recurrent. 



In particular this holds for any fixed, sufficiently small J. We formulate this result 
for random J because of the relation to LRRW explained above — we could have also 
proved the LRRW results for random a but we do not see a clear motivation to do so. 
We will prove this in § 5, where we will also give more precise information about the 
dependency of c on K. 

Next we wish to present a result on exponential decay of the w^eights, an analogue of 
Theorem 2. To make the statement more meaningful, let us describe the RWRE (which, 
we remind, is not Yf but a time change of it). We are given a random function W on the 
vertices of our graph G. The process is then a standard continuous- time random walk 
which moves from vertex i to vertex j with rate ^JijWj/Wi. The result is now: 

Theorem 6 If the Je are independent, with E/^^ < c{K) then for a.e. J, the infinite volume 
mixing measure exists and under the joint measure, for any vertex v E G, 



EWy^ < 2K"'*'^'®'("'',»)_ 



In particular, this implies that the time-discretization of Y is positively recurrent, and 
not just recurrent. For any s < 1/5, bounds on lEp can yield the an estimate on E Wfj and 
Theorem 5. 

It is interesting to note that the proof of Theorems 5 and 6 are simpler than that of 
Theorems 1 and 2, even though the techniques were developed to tackle LRRW. The 
reader will note that for VRJP, each edge can be handled without interference from ad- 
joining edges on the same vertex, halving the length of the proof. Is there some inherent 
reason? Is VRJP (or the supersymmetric a model) more basic in some sense? We are not 
sure. 

1,3 Notations 

In this paper, G will always denote a graph with bounded degrees, and K a bound on 
these degrees. The set of edges of G will be denoted by E. flg w^ill always denote the 
initial weights, and when the notation a is used, it is implicitly assumed that a^ = a for 
all edges e G £. 

Let us define the LRRW again, this time using the notations we will use for the 
proofs. Suppose we have constructed the first k steps of the walk, xq,. . . ,Xf^. For each 
edge e G £, let 

me) = \{j<k:e={Xj,Xj+^)}\ 

be the number of times that the undirected edge e has been traversed up to time k. Then 
each edge e incident on X;^ is used for the next step with probability proportional to 
a + Nk{e), that is, li X^ — v then 



where dj, is the degree of v; where Nk{v) denotes the sum of Nk{e) over edges incident to 
v; and where ~ is the neighborhood relation i.e. v ^ iv <^==> {v, w) G E. It is crucial to 
remember that N/^ counts traversals of the undirected edges. We stress this now because 
at some points in the proof we will count oriented traversals of certain edges. 

While all graphs we use are undirected, it is sometimes convenient to think of edges 
as being directed. Each edge e has a head e^ and tail e+. The reverse of the edge is 
denoted e^^. ApathoflengthninGmay thenbe defined as a sequence of edges ei,. . .,e„ 
such that e^ = er -^ for all i. Vice versa, if v and w are two vertices, {v, w) will denote the 
edge whose two vertices are v and w. 

By W we will denote a function from E to [0, oo) ("the weights"). We will denote We 
instead of W(e) and for a vertex v we will denote 

W„ := ^ We. 

eBv 

The space of all such W will be denoted by O and measures on it typically by }i. The 
}i which describes our process (whether unique or not) will be called "the mixing mea- 
sure". 

Given W we define a Markov process with law P^ on the vertices of G as follows. 
The probability to transition from v to iv, denoted P^(y, w) is W^j, j„^ / W^. For a given }i 
on O, the RWRE corresponding to /^ is a process on the vertices of G given by 

W{Xo = VQ,...,Xn = Vn)= /nP^(X,_i,X,)dfi(W). 

This process will always be denoted by X. 

Definition A process is recurrent if it returns to every vertex infinitely many times. 

This is the most convenient definition of recurrence for us. It is formally different 
from the definition of [2] we quoted earlier, but by the results of [13] quoted above they 
are in fact equivalent for LRRW. 

We also use the following standard notation; for two vertices v and w we define 
dist(y, lo) as the length of the shortest path connecting them (or if u = id). For an edge 
e we define dist(y,e) = min{dist(y,e^),dist(t7,e+)}. For a set of vertices A we define 
dA = {x : dist(x. A) = 1} i.e. the external vertex boundary. ByX = Y we denote that the 
variables X and Y have the same distribution. Bern(/3) will denote a Bernoulli variable 
with probability p to take the value 1 and 1 — p to take the value and Exp (A) will 
denote an exponential random variable with expectation 1/A. We will denote constants 
whose precise value is not so important by c and C, where c will be used for constants 
sufficiently small, and C for constants sufficiently large. The value of c and C (even if 
they are not absolute but depend on parameters, such as C{K)) might change from one 



place to another. By x x y we denote ex < y < Cx. 
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2 Proof of Theorem 2 

In this section we give the most interesting part of the proof of Theorem 2, show^ing 
exponential decay assuming a priori that the proeess is reeurrent. We give an upper bound 
^vhich only depends on the maximal degree K. In the next section use apply this result 
to a sequence of finite approximations to G to prove recurrence for the whole graph and 
complete the proof. 

Before we begin, we need to introduce a few notations. For any edge e = (e^,e+) 
that is traversed by the walk, let e' be the edge through which e^ is first reached. In 
particular, e' is traversed before e. If e is traversed before its inverse e^^, then e' is distinct 
from e as an undirected edge. Iterating this construction to get e", e'", etc. yields a path 
7 = {. . . , e", e', e} starting with the first edge used from vq, and terminating with e. We 
call 7 the path of domination of e. This path is either a simple path, or a simple loop 
in the case that e+ is the starting vertex Vq. In the former case 7 is the backwards loop 
erasure of the LRRW. All edges in the path are traversed before their corresponding 
inverses. Let ^y be the event that the deterministic path 7 is the path of domination 
corresponding the final edge of 7. 

For an edge e with e' 7^ e^^, let Q{e) be an estimate for W^/Wg/ defined as follows: If 
e is crossed before / := (e')^^ then set Me to be the number of times e is crossed before 
/, and set Mf = 1. If / is crossed before e then set Mf to be the number of times / is 
crossed before e, and set Me = 1. In both cases we count crossing as directed edges. In 
other words, we only count crossings that start from e^ , the common vertex of e and / 
(in which case the walker chooses between them). Then 

Qie) := ^ 
^^ ' Mf 

is our estimate for ]Ne/^e'- Thus to find Q(e) we wait until the LRRW has left e^ along 
both e and /, and take the ratio of the number of departures along e and along / at that 
time. Note again that we do not include transitions along e^^ or /^^ = e' , so that by 
definition one of the tw^o numbers Me and Mf must be 1. 

With Q defined w^e can start the calculation. Recall that e^ is the first edge crossed 
by the walk. Suppose x is some edge of G and we want to estimate VSlx/V^ei- We fix x 



for the reminder of the proof. Let T = Tx denote the set of possible values for the path 
of domination i.e. the set of simple paths or loops whose first edge is one of the edges 
coming out of vq and whose last edge is either x or x^^ (deperiding on which is crossed 
first). 

Split the probability space according to the value of the path of domination: 



E 



Wv 



w, 



ei 



7er 



t)^(-^> 



Naturally, under ^y, ei must be the first edge of 7. We remind the reader that we as- 
sume a priori that our process is recurrent. This has two implications: first, x will a.s. be 
visited eventually, and so the path of domination is well-defined. Second, the weights 
are unique, so W^/V^ei is a well-defined variable on our probability space. 

Given the weights W, let R be the actual ratios along the path of domination, so for 
eG7\{ei}, 

On the event ^^ for 7 fixed, w^e may telescope W^/We^ via 



^ = n m 



n fi n Qw. 



An application of the Cauchy-Schwarz inequality then gives 



E 



S'><-^' 



<E 



n m 

eG7Vi ^^*^ ' 



2s 



1{%} 



E 



n Q(^ri{%} 

.cG7\ei 



(2) 



Theorem 2 then essentially boils down to the following two lemmas. 



Lemma 7 For any graph G, any starting vertex vq and any a S (0, 00) such that the LRRW 
on G starting from vq with initial weights a is recurrent, for any edge x G E, any 7 G T^ and 
any s G (0,1), 



E 



n (M ) i{^.} 



<C(s) 



ItI-i 



where C{s) is some constant that depends on s but not on G, Vq, a or anything else. 

Lemma 8 For any graph G with degrees bounded by K, any starting vertex vq and any a G 
(0,00) such that LRRW on G starting from vq with initial weights a is recurrent, for any edge 
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X G G, any 7 G F^c and any s G (0, 1/2), 



E 



n Qw^i{^7} 

eG7\ei 



< 



C(s,i<C)fl 



|7|-1 



where C{s, K) is a constant depending only on s, K. 



In these two lemmas, it is not difficult to make C{s) and C{s,K) explicit, following the 
proof gives C{s) = O ( j^ ) and C{s,K) = 0{ j^ + s ) • However, there seems to be little 
reason to be interested in the s-dependency. We w^ill apply the lemmas with some fixed 
s, say 1/4. We do not have particularly strong feelings about the X-dependency either. 
It is worth noting that Lemma 7 is proved by using the random environment point of 
view on the LRRW, while Lemma 8 is proved by considering the reinforcements. Thus 
both views are central to our proof. We note that Theorem 2 can be extended to s < 1/3 
by simply using Holder's inequality instead of Cauchy-Schwartz. Relaxing the limit on 
s in Lemma 7 may allow any s < 1/2, but that is the limit of our approach, since the 1/2 
in Lemma 8 is best possible. 

Proof of Lemma 7. For this lemma the RWRE point of view is used, as it must, since the 
weights W appear in the statement, via R. Our first step is to throw aw^ay the event 
1(^7 } i.e. to write 



E 



n 



Rje) 



1{&,} 



<E 



n 

e£7\ei 






(3) 



where the terms on the right-hand side are as follows: R^(e) is the ratio betw^een Wg and 
Wf where / is the predecessor of e in 7; and Q-Y{e) is defined by following the process 
until both e and / are crossed at least once from e^ and then define Me, Mf and Q 
according to these crossings. Clearly, under ^^ both definitions are the same so (3) is 
justified. 

This step seems rather wasteful, as heuristically one can expect to lose a factor of K'^' 
from simply ignoring a condition like &^. But because our eventual result (Theorem 2) 
has a C{s, Ky^' term, this will not matter. Since 7 is fixed, from this point until the end 
of the proof of the lemma we will denote R = Rj and Q = Q-y. 

At this point, and until the end of the lemma, we fix one realization of the w^eights 
W and condition on it being chosen. This conditioning makes the R{e) just numbers, 
while the Q{e) become independent. Indeed, given W, the random walk in the random 
environment can be constructed by associating with each vertex v a sequence Zf^ of i.i.d. 
edges incident to v with law Wg/ Y^^^v ^e- If the walk is recurrent, then v is reached 
infinitely often, and the entire sequence is used in the construction of the walk. If we 
fix an edge e and let / = e'^^, then Me (resp. Mj) is the number of appearances of e 



11 



(resp. /) in the sequence {Z^} up to the first time that e and / have both appeared. As 
a consequence, since the sequences for different vertices are independent, we get that 
conditioned on the environment W, the estimates Q(e) for Wg/VSIf for pairs incident to 
different vertices are all independent. 

Thus to prove our lemma it suffices to show that for any two edges e, f leaving some 
vertex v, with Me and Mj defined as above we have 



E 



Y^M£' 

Wf Me 



W 



<C(s) 



We now show that this holds uniformly in the environment. 

First, observe that entries other than e, f in the sequence of i.i.d. edges at v have no 
effect on the law of Me and Mf, so w^e may assume w.l.o.g. that e and / are the only 
edges coming out of e^ . Denote the probability of e by p and oi fhy q = 1 — p (for some 
p G (0, 1)). Now we have for n > 1, that the probability that e appears n times before / 
is p"q, and similarly for / before e with the roles of p and q reversed. Thus 



E 



W^Mf^' 
Wf M, 



W 



p 
(1. 

W,/Wf 



k>l k>l 



f first 



e first 



It is a straightfor^vard calculation to see that this is bounded for \s\ < 1. The first term 
is the s-moment of a Geom(p) random variable, which is of order p^^q, and with the 
pre-factor comes to q^^^. The second term is the (—s) -moment of a Geora{q) random 
variable, which is of order pq^, and with the pre-factor gives p*+^. Thus 



E 



Wf Me 



w 



,1-s 



A+s 



<C{s) 



(recall that we assumed s G (0, 1)). This finishes the lemma. 
Now we move on to the proof of Lemma 8. 



n 



Proof of Lemma 8. Fix a path 7. We shall construct a coupling of the LRRW together with 
a collection of i.i.d. random variables Q(e), associated with the edges of 7 (except ei) 
such that on the event &j, for every edge e G 7 \ ei we have Q(e) < Q(e), and such that 
for s < 1/2, 

]EQ{eY<C{s,K)a. 
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The claim would follows immediately, because 

lEYlQieYl{!^^} < lEYlQieyi{!^^} < EYlQ{ey = nEQl^r 
<YlC{s,K)a = (C(s,K)fl)'^'"\ 

The remarks in the previous proof about "waste" are just as applicable here, since we 
also, in the second inequality, threw away the event ^j. Note that we cannot start by 
eliminating the restriction, since we only prove Q < Q on the event &j. 

Let us first describe the random variables Q. Estimating their moments is then a 
straightforward exercise. Next we will construct the coupling, and finally we shall verify 
that Q(e) < Q(e) . For an edge e = {e^, e+ ) of 7, we construct two sequences of Bernoulli 
random variables (both implicitly depending on e). For / > 0, consider Bernoulli random 
variables 

v. ^ Bern ( -^-—] , Y' = Bern f — -1±^ 

^ \j + l+2aj' ^ \2] + l + Ka 

where Bern(p) is a random variable that takes the value 1 with probability p and w^ith 
probability 1 — p. All Y and Y' variables are independent of each other and of those 
associated with other edges in 7. In the context of the event ^^, we think of Yq as the 
event that decides which of e and / is crossed first. For / > 1, think about Yy as telling 
us whether on the /'^ visit to e^ we depart along e and Y' telling us whether we depart 
along / = e'^^. This leads to the definition 



Q = Me/Mf, 



w^here 



Me = min{/ > 1 : Y| = 1} and Mf = 1, if Yq = 0, 

Mf = min{/ >l:Yj = l} and Me = 1, if Yq = 1. 

Moment estimation. To estimate EQ we note 

^ '^ ' '^ ' 1 -i- Vn In -i- 1 J- Vn i- i- \ 



1 + Ka 2n + 1 + Xfl y V 2; + 1 + Ka 
The first two terms we estimate by 

{K-l)a 1 + a Ka 1 + Ka _ Ka 

1 + Ka In + l + Ka ~ 1 + Ka In ~ In 



13 



while for the product we note that for any a > 0, 

1+fl ^ . f 1 1 
> mm 



Ij + l + Ka- \2i + lK 

Putting these together we get 

P(y^ = CM, = n)< ^ 'riexp (-^ + 0(r')) = ^exp (-ilog(n) + 0(1)) 
Thus for s < 1/2, 



E 



Q(e)=l{y^ = 0}] < E "'^'(^0 = 0,M, = n) 

< Yj C{K)an'-^'^ < C{s,K)a. 



n>l 



(This is the main place where the assumption s < 1/2 is used.) 

we write 

p(y^ = 1,M(/) = n) < P(y„ = !)<-, 



For the case Yq = \ we write 



and so 



E 



Q(e)^l{y^ = 1}] = X: «n-'-^ < C(s)fl. 



n>l 



Together we find EQ(e)^ < C{s,K)a as claimed. 

The coupling. Here we use the linear reinforcement point of view of the walk. We 
consider the Bernoulli variables as already given, and construct the LRRW as a function 
of them (and some additional randomness). Suppose we have already constructed the 
first t steps of the LRRW, are at some vertex v, and need to select the next edge of the 
walk. There are several cases to consider. 

Case 0. y ^ 7: In this case we choose the next edge according to the reinforced edge 
weights, independently of all the Y and Y' variables. 

Case 1. y G 7 and the LRRW so far is not consistent with ^^: We may safely disregard 
the y variables, as nothing is claimed in this case. This case occurs if for some 
edge e G 7 is traversed only after its inverse is, or if the first arrival to e^ was not 
through the edge preceding e in 7. 
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Case 2. u G 7, and Q is already determined: li v = e^ for e G 7, and both e and 
(e')^^ have both already been traversed, then Q(e) is determined by the path so 
far. Again, we disregard the Y variables. 

For the remaining cases, we may assume that &y is consistent with the LRRW path so 
far, and that v = e^ for e G 7 \ ei. As before, let / = {e')^^. If we are not in Case 2, then 
one of e, f has not yet been traversed. 

Case 3. This is the first arrival to v. In this case the weights of all edges incident to v are 
still a, except for / which has weight 1 + a. Thus the probability of exiting along / 

is 

1+fl 1+fl 



1 + d-„a ~ 1 + Xfl 

where as usual dj, is the degree of the vertex v. Thus we can couple the LRRW step 
so that if Yq = 1, then the walk exits along / (and occasionally also if Yg = 0). 

Case 4. Later visits to v when Yq = 0. Suppose this is the n* visit to v. The w^eight of / 
is at least 1 + fl, since we first entered v through /^^. The total weight of edges at 
u is 2n — 1 + dytt. Thus the probability of the LRRW exiting through / is 

Nt{f)+a ^ 1 + fl 



2n - 1 + d-^a ~ 2n-l + Ka 

where t is the time of this visit to v, and Nf (/) is, as usual, the number of crossings 
of the edge / up to time t. Thus we can couple the LRRW step so that if Y^_^ = 1, 
then the walk exits along / (and occasionally also iiY^_^ = 0). 

Case 5. Later visits to v with Yq = 1. Here, / was traversed on the first visit to v. Since 
we are not in Case 2, e has not yet been traversed. Since we are not in Case 1, 
neither has e^^, and so e still has weight a. In this case, we first decide with ap- 
propriate probabilities, and independent of the Y, whether to use one of {e,/}, or 
another edge from v. If we decide to use another edge, we ignore the Y variables. 
If we decide to use one of {e, /}, and this is the n* time this occurs, then Nt (/) > n 
(since we had only chosen / so far in these cases, and also used /^^ at least once). 
Thus the probability that we select e from {e,f} is 

a a 

< 



Nt{f)+2a ~ n + 2fl' 
Thus we can couple the LRRW step so that if Y„_i = 0, then we select /. 

Domination of Q(e). We now check that on ^^ we have Q(e) < Q(e). Assume &y 
occurs. When Yq = the coupling considers only the Y' variables, one at each visit to 
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V until / is used. If n is minimal s.t. Y^ = 1 then / is used on the (n + 1)^' visit to v or 
earlier. Thus 

Q{e) <M,<n = Q{e). 

Note that if Yg = it is possible that the walk uses / before e, and then Q{e) < 1. 

If Yq = 1 then the coupling considers only the Ys, one at each time that either e or f 
is traversed (and not at every visit to v). If n is the minimal n > 1 s.t. Y„ = 1, then / is 
used at least n times before e. Thus 



M 



f 



and in both cases Q(e) < Q(e), completing the proof. 



D 



Let us remark briefly on how this argument changes if the weights a are not equal, 
or possibly random. In order to get the domination Q(e) < Q(e) the variables Yj and Y- 
are defined differently. Setting a„ = J^^^^ flg, we use 



Yj = Bern 



i + l + ae + af 



^i 



Bern 



1 + fl 



/ 



2/ + 1 + fl„ 



This changes the moment estimation, and instead of Ca we get C(fli, + fl^^''). If the a^s are 
all sufficiently small there is no further difficulty. If the a's are random, this introduces 
dependencies between edges, and some higher moments must be controlled. 

Corollary Theorem 2 holds on graphs where the LRRW is recurrent. 
Proof. This is just an aggregation of the results of this section: 



^m-L^Mn..^ 



by (2) 
By Lemmas 7 and 8 



<EE 



n (^ 



2s 



\{%} 



E 



n Q(e)^^l{^7} 

,e£7\ei 



<X] C(2s)lT'l-i 2 (c(2s,K)fl) 



ItI-i 



= E (Cov/^) 



|7|-1 



for Co = a/C(2s)C(2s, X). Now, the number of paths of length i is at most K^ and the 

1 

2 



shortest path to e has length dist(e, uq) + 1- H we take aq such that XCoi/flo = \ then for 
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a < uq longer paths give at most a factor of 2 and so 

3 Recurrence on Bounded Degree Graphs for Small Values of a 

We are now ready to prove Theorem 1. As noted above, the main idea is to approximate 
the LRRW on G by LRRW on finite balls in G. Let R > and let X(^) be the LRRW 
in the finite volume ball Br{vo). By Theorem 4, for each R the full distribution of X'^' 
is given by a mixture of random conductance models with edge weights denoted by 
( Wg )eeBj^(uo)' ^ri<i mixing measure denoted by //^^' . Recall O = R^ is the configuration 
space of edge weights on the entire graph G. For fixed a, the measures }i^^> form a 
sequence of measures on O, equipped with the product Borel cr-algebra. 

Proof of Theorem 1. Theorem 2 (or 4 if you prefer) implies that the measures }i^'^' are tight, 
and so have subsequential limits. Let }i be one such subsequence limit. Clearly the law^ 
of the random walk in environment W is continuous in the weights. However, the first 
R steps of the LRRW on Br{vo) have the same law as the first R steps of the LRRW on 
G. Thus the LRRW on G is the mixture of random walks on weighted graphs with the 
mixing measure being /^. 

Fix some s < 1/4 for the remainder of the proof. For any edge e G Br(uo) we have by 
Markov's inequality that 



where C = C{s,K) comes from Theorem 2, which we already proved for recurrent 
graphs. Taking Q = (^2K)^ '^'^^^^'^''"°\ and since the number of edges at distance £ from 
Vq is at most K^^^, we find that the probability of having an edge at distance £ with 
We > (2i<:)"^isatmost 

K^+\2Ky^2K{C^aY = 2K^ (I'K^+'C^a 



Let flo be such that 2^K^^^C^/ao = ^. Then for a < uq this last probability is at most 
K^2^^^. Crucially, this bound holds uniformly for all }i^^', and hence also for ji. By 
Borel-Cantelli, it follows that j^-a.s., for all but a finite number of edges we have W^ < 
(2X)^'^'^*('^'"'') On this event the random network (G, W) has finite total edge weight 
W = ^g Wg, and therefore the random walk on the network is positive recurrent, w^ith 
stationary measure ti{v) = Wi,/2W. D 



17 



3.1 Trapping and Return Times on Graphs 

With the main recurrence results. Theorems 1 and 2 proved, we wish to make a remark 
about the notions of "localization" and "exponential decay". We would like to point 
out that one should be careful when translating them from disordered quantum systems 
into the language of hyperbolic non-linear c^-models and the LRRW. To illustrate this 
point, let G be a graph with degrees bounded by K, and consider the behavior of the tail 
of the return time to the initial vertex Vq. Let 

T = min{i > : Xt = Vq}. 

Proposition 9 Let K be fixed. Suppose that G is a connected graph with degree bound K and 
some edge not incident to vq. Then there is a constant C > depending on K but not on a so that 



P(t>M) >c{a,K)M 



-{K-l)a 



Thus despite the exponential decay of the weights, the return time has at most {K — 
l)fl — 2 finite moments. 

Proof. There is some edge e at distance 1 from vq. Consider the event E^ that the LRRW 
moves towards e and then crosses e back and forth 2M times. On this event, t > 2M. 
The probability of this event is at least 



P(£ 



M, 



1 
> — 
^ K 



1 
K 



^ 2j + a 

0<;<M 2; + 1 + ^« 



n 

0</<M 



{K-l)a 
2; + l 



>c{a,K)M-^^-^^''. 



y^ Ij + l + a 

0<;<M 2/ + 1 + ^« 



o 



n 



One might claim that we only showed this for the first return time. But in fact, mov- 
ing to the RWRE point of view shows that this is a phenomenon that can occur at any 
time. Indeed, ifP(T > M) > £ then this means that with probability > ^e, the envi- 
ronment W satisfies that P(t > M | W) > ^e. But of course, once one conditions on the 
environment, one has stationarity, so the probability that the /c* excursion length (call it 
Tk) is bigger than M is also > \e. Returning to unconditioned results give that 

P(t^ > M) > ^£2 = c{a,K)M-^'^^-^^'' VJt 

This is quite crude, of course, but shows that the phenomenon of polynomial decay of 
the return times is preserved for all times. 
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3.2 Reminiscences 

Let us remarks on how we got the proof of these results. We started with a very simple 
heuristic picture, explained to us by Tom Spencer: when the initial weights are very 
small, the process gets stuck on the first edge for very long. A simple calculation shows 
that this does not hold forever but that at some point it breaks to a new edge, it then 
runs over these two edges, roughly preserving the weight ratio, until breaking into a 
third edge, roughly at uniform and so on. 

Thus our initial approach to the problem was to try and show using only the LRRW 
picture that weights stabilize. This is really the point about the linearity — only lin- 
ear reinforcement causes the ratios of weights of different edges to converge to some 
value. One direction that failed was to show that it is really independent of the rest 
of the graph. We tried to prove that "whatever the rest of the graph does, the ratio of 
the weights of two edges is unlikely to move far, when both have already been visited 
enough times". So we emulated the rest of the graph by an adversary (which, essen- 
tially, decides through which edge to return to the common vertex), and tried to prove 
that any adversary cannot change the weight ratio. This, however, is not true. A sim- 
ple calculation that shows that an adversary which simply always returns the walker 
through edge e will beat any initial advantage the other edge, / might have had. 

Trying to somehow move a^vay from a general adversary, we had the idea that the 
RWRE picture allows to restrict the abilities of the adversary, which finally led to the 
proof above (w^hich has no adversary, of course). We still think it would be interesting 
to construct a pure LRRW proof, as it might be relevant to phase transitions for other 
self-interacting random walks, which are currently wide open. 

4 Transience on Non-amenable Graphs 

In this section we prove Theorem 3. The proof is ideologically similar to that of The- 
orem 2, and in particular we prove the main lemmas under the assumption that the 
process is recurrent — there it seemed natural, here it might have seemed strange, but 
after reading § 3 maybe not so much: in the end we will apply these lemmas for finite 
subgraphs of our graph. We will then use compactness, i.e. Theorem 4, to find a subse- 
quential limit on G which also describes the law of LRRW on G. 

Let us therefore begin with these lemmas, which lead to a kind of "local stability" of 
the environment W when the initial weights a are uniformly large. 

As in the case of small a, our stability argument has two parts. We use the dynamic 
description to argue that the walk typically uses all edges leaving a vertex roughly the 
same number of times (assuming it enters the vertex sufficiently many times). We then 
use the random conductance view point to argue that the random weights are typically 
close to each other. Finally, we use a percolation argument based on the geometry of the 
graph to deduce a.s. transience. 
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4.1 Local Stability if LRRW is Recurrent 

Let us assume throughout this subsection that the graph G and the weights a are given 
and that they satisfy that LRRW is recurrent. 

Let L be some parameter which we will fix later. The main difference between the 
proofs here and in § 2 is that here we will examine the process until it leaves a given 
vertex L times through each edge rather than once. Let therefore t = t{L,v) denote 
the number of visits to v until the LRRW has exited v at least L times along each edge 
e 3 V. Note that since we assume that the LRRW is recurrent, these stopping times are 
a.s. finite. Let M{e) = M{L,v,e) denote the total number of outgoing crossings from 
V along e up to and including the t'^ exit. Given the environment W, call a vertex v 
£-faithful if M(e) /t is close to its asymptotic value of Wg/W^, i.e. 

Lemma 10 Let the degree bound K be fixed. Then for any £,S > there exists L = L{K, e, S) 
so that 

F^{v is not e-faithful) < 5. 

Moreover, these events are independent under P^. 

A crucial aspect of this lemma is that L does not depend on the environment W. A 
consequence is that for this L, the e-faithful vertices stochastically dominate \ — 5 site 
percolation on G for any P^, and hence also for the mixture P. 

Proof. Given W, the exits from v are a sequence of i.i.d. random variables taking d-,, < K 
different values with some probabilities p{e) = W^/W^. Let A„(e) be the location of 
the n* appearance of e in this sequence, so that A„ is the cumulative sum of Geom(p) 
random variables. By the strong law of large numbers, there is some L so that w^ith 
arbitrarily high probability (say, 1 — 3), we have 

pAn 



< e for all n > L. (4) 

We now claim that L can be chosen uniformly in p (for e and S fixed). This can be 
proved either by a replacing the law of large numbers by a quantitative analog, say a 
second moment estimate, or by using continuity in p: Indeed, the only possible discon- 
tinuity is at zero, and pA„ converges as p — )■ to a sum of i.i.d. exponentials, and so 
with this interpretation the same claim holds also for p = 0. Since the probability of a 
large deviation at some large time for a sum of i.i.d. random variables is upper semi- 
continuous in the distribution of the steps, and since upper semi-continuous functions 
on [0, 1] are bounded from above, we get the necessary uniform bound. 
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We now move back from the language of A„ to counting exits at specific time t. De- 
note Mt{e) the number of exits from v through e in the first t visits to v. lit ^ [An, A„+i) 
then Mt = n and hence, for t > Ai and with probability > 1 — 3, 



Mt{e) ^ 



pt \pAn+l' pAn 



(4) 

c[l-eM + e'] 



(where e' is some function of e with e' — > when e — > 0). 

Since t > Ai{e) for all edges e coming out of v, we get M{e)/pT G [1 — e', 1 + e'] for 
all e, •with probability > 1 — KS. Replacing e' with e and KS with 3 proves the lemma. D 

The second step is to use the dynamical description of the LRRW to show that if a is 
large then the M{L,v,e) are likely to be roughly equal. Let S{L,v) = maxego M{L,v,e) 
be the number of exits from v along the most commonly used edge. By the definition, 
M{L, v,e) > L for all e, v. We will therefore call a vertex e-balanced if S{L, v) < {1 + e)L. 

Lemma 11 For any K, e, 3 there is an Lq — Lo{K, e, 3) such that for any L > Lq there is some 
flo so that for any a > ao such that LRRW on G with initial weight a is recurrent we have 

W{v is not e-balanced) < 3. 

Moreover, for such a, the e-balanced vertices stochastically dominate 1 — 3 site percolation. 

Proof. We prove directly the stochastic domination. To this end, we prove that any v 
is likely to be e-balanced even if we condition on arbitrary events occurring at other 
vertices. At the /* visit to v, the probability of exiting via any edge e is at least a/ {d^a + 
2z — 1). Throughout the first T visits to v, this is at least \/dv — TT / a. Since this bound 
is uniform in the trajectory of the walk, we find that the number of exits along an edge e 
stochastically dominates a Bin(T, j — ^) random variable, even if we condition on any 
event outside of v. 

We take T = {l-\- e)d^L. If a is large enough {CK^L/e suffices) then the binomial has 
expectation at least L — ^eL. Since it has variance at most T = 0{L), if L is sufficiently 
large then the binomial is very likely to be at least L. In summary, given 3 and e we can 
find some large L so that for any large enough a, with probability at least 1 — 3 there are 
at least L exits along an any edge e up to time T. This occurs for all edges e 3 v •with 
probability at least 1 — K3. 

Finally notice that if all edges are exited at least L times, then this accounts for dyL 
exits, and only T — dyL exits remain unaccounted for. Even if they all happen at the same 
edge, that edge would still have only L + edyL exits, hence S < L + edyL. Therefore v is 
ed^j-balanced with probability at least 1 — K3. Redefining e and 3 gives the lemma. D 

Call a vertex e-good (or just good) if it is both £-faithful and e-balanced. Otherwise, 
call the vertex bad. Note that if v is good, then weights of edges leaving v differ by a 
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factor of at most j-^ < (1 - e)"^- 

Corollary 12 For any K, e, 3, for any large enough L,for any large enough a, the set ofe-good 
vertices stochastically dominates the intersection of two Bernoulli site percolation configurations 
with parameter 1 — 3. 

Unfortunately, these two percolation processes are not independent, so we cannot 
merge them into one (1 — 3)'^ percolation. 

4.2 Application to Infinite Graphs 

Let G„ denote the ball of radius n in G = {V,E) with initial vertex vq. Let }i„ denote 
the mixing measure guaranteed by the first half of Theorem 4. Further let ^„ be the 
sequence of coupling measures guaranteed by Corollary 12. 

According to the second half of Theorem 4 the measures f/„ are tight. Quite obviously 
the remaining marginals of ^„ are also tight. Therefore ^„ is tight. Thus we can always 
pass to a subsequential limit ^ so that the first marginal is a mixing measure for }i and 
the conclusion of Corollary 12 holds. We record this in a proposition: 

Proposition 13 For any K, e, 3, for any large enough L, for any large enough a the following 
holds. For any weak limit }i of finite volume mixing measures there is a coupling so that the set 
of e-good vertices (with respect to ji) stochastically dominates the intersection of two Bernoulli 
site percolation configurations with parameter 1 — 3. 

We now use a Peierls' argument to deduce transience for large enough a. We shall 
use two results concerning non-amenable graphs. The standard literature uses edge 
boundaries so let us give the necessary definitions: we define the edge boundary of a set 

by 9e(A) = {{x,y) E E : x e A,y ^ A}, VolA = Lvga^v and 



AcG,VolA<oo Vol A 

Clearly 

. = mf^>mf^^^-^ = ^.E. 

and similarly l < K^l^. 

The first result that we will use is Cheeger's inequality: 

Theorem 14 IfG is non-amenable then the random walk on G has return probabilities pn (0, 0) < 
Ce~^", where j6 > depends only on the Cheeger constant i£(G). 

Cheeger proved this for manifolds. See e.g. [3] for a proof in the case of graphs. 
A second result we use is due to Virag [20, Proposition 3.3]. Recall that the anchored 
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expansion constant is defined as 

\dEA\ 



a(G) = lim inf 



)i^oo|^l>„ Vol(A) 

where the infimum ranges over connected sets containing a fixed vertex v. It is easy to 
see that a is independent of the choice of v. 

Proposition 15 ([20]) Every infinite graph G with a-anchored expansion contains for any 
e > an infinite subgraph with edge Cheeger constant at least a. — a. 

Proof of Theorem 3. Let £ be some small number to be determined later, and let G^ denote 
the set of £-good vertices (and also the induced subgraph). For any set F with boundary 
dF, the £-bad vertices in dF are a union of two Bernoulli percolations, each of which 
is exponentially unlikely to have size greater than than ||9F|, provided that 3 < ^. 
Specifically, 



F(\GendF\ < -\dF\\ <e-^|9^l < g-^'l^l. 



where c = c{S) tends to oo as i5 — ?■ 0, and i > is the Cheeger constant of the graph. 
The number of connected sets F of size n containing the fixed origin o, is at most 2^" 
(this is true in any graph with degree bound K — the maximum is easily seen to happen 
on a iC-regular tree, on which the set can be identified by its boundary which is just Kn 
choices of whether to go up or down). Thus if 3 is small, so that c{3) > X(log2) //, then 
a.s. only finitely many such sets F have bad vertices for half their boundary. 

Taking such a 3, we find that G^ contains an infinite cluster 'if which has "vertex 
anchored expansion" at least /(G) /2. Moving to edge anchored expansion loses a K^ 
and w^e get a > l/2K^. By proposition 15 we find that 'tf contains a subgraph H -with 
<E(H)>i(G)/3K2. 

By Theorem 14, the simple random walk on H has exponentially decaying return 
probabilities. However, since all vertices of H are £-good, edges incident on vertices of 
H have weights within (1 — £)^^ of equal. Thus the random walk on the weighted graph 
(H, W) is close to the simple random walk on H. Specifically, letting p^ denote the heat 
kernel for the W-weighted random walk restricted to H, then we find 

p^(0,0) < (1 - £)-2"p„(0,0) < C(l - e)-^"e-^", 

where /S is some constant depending only on G. In particular for £ small enough, these 
return probabilities are summable and the walk is transient. 

Finally by Rayleigh monotonicity (see e.g. [4]), (G, W) is transient as well, complet- 
ing the proof. D 

Since there are several parameters that need to be set in the proof, let us summarize 
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the final order in which they must be chosen. The graph G determines K and '(G) . Then 
S is chosen to get large enough anchored expansion. This determines via Theorem 14 
the value of j6 for the subgraph H, which determines how small e needs to be. Finally, 
given e and S we take L large enough to satisfy Lemmas 10 and 11 and the minimal a is 
determined from Lemma 11. 

5 Vertex Reinforced Jump Process 

In this section we apply our basic methods to the VRJP models mentioned in the intro- 
duction. This demonstrates the flexibility of the approach and gives a second proof of 
recurrence of LRRW based on the embedding of LRRW in VRJP with initial rates J i.i.d. 
with marginal distribution r(fl, 1). 



5.1 Times they are a changin' 

As explained in the introduction, the VRJP has a dynamic and an RWRE descriptions, 
related by a time change. Let us give the details. The dynamic version we will denote 
by Yf, and its local time at a vertex x hy Lx{t), so that t = J^x^x{i)- Recall that Yt moves 
from X to y with rate /x,i/(l + hj{t))- 

The RWRE picture is defined in terms of a positive function W = {W-o)i,^v on the 
vertices. Given W we will denote by Z, = Z^ the random walk in continuous time that 
jumps from x to y with rate \jxy^y/^x- We will denote the local time of Z by Mx{s), 
and again s = J^^ M.r(s)- When some random choice of W is clear from the context w^e 
will denote by Zg the mixed process. 

The time change relating s and t is then given by the relation that M; 
equivalently that Lj = ^1 + Mx — 1. Summing over all vertices gives a relation between 
s and t. Since the local times are only increasing at the presently occupied vertex, this 
gives the equivalent relations ds = 2(1 + LYXi))dt and dt = ds/l^J\ + MzX^)- 

Theorem 16 ([19]) On any finite graph there exists a random environment W so that (Zg) is 
the time change of {Yt) given above. 

For the convenience of the reader, here is a little table comparing our notation with 
those of [19]: 



L\ + 2Lj, or 



Here 


/ 


W 


L 


[19] 


W 


,u 


L-1 



In fact. Sabot and Tarres also give an explicit formula for the law of the environment W. 
However, as with the LRRW, we do not require the formula for this law, but only that it 
exists. 
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One more way to think of all this is that the process has two clocks measuring the 
occupation time at each vertex. From the VRJP viewpoint, the process jumps from i to 
/ at rate /y(l + Lj{t))dt (i.e. with respect to the L's). From the RWRE viewpoint, the 
process jumps at rate ^JijWj/Wjds. Theorem 16 states that the above relation between ds 
and dt, these two descriptions give the same law for the trajectories. 

It is interesting to also describe the process in terms of both the local times and re- 
inforcement. Using the time of the L^s, and given the environment, we find that the 
process jumps from i to ; at rate /^(Wj/ W,) (1 + Li)dt. Note that here it is L, and not Lj 
that controls the jump rates. Similarly, we can get a reinforcement description for Z: the 
process jumps from i to / at rate /yV(l + Mj)/{1 + Mi)ds. This gives a description of 
Z with no random environment, but using reinforcement. Nevertheless, it will be most 
convenient to use Z for the RWRE side of the proof and Y for the dynamical part. 

A final observation is that conditional on J and W, the discretization of Zg (i.e. the 
sequence of vertices visited by the process) has precisely the same law as the random 
walk on G with conductances given by Cy = JijWiWj. 

5.2 Guessing the environment 

As with the LRRW, the main idea is to extract from the processes some estimate for the 
environment, and show that it is reasonably close to the actual environment on the one 
hand, and behaves well on the other. 

For neighboring vertices /,/', let Sy be the first time at which Z jumps from i to /, and 
let Tij = Mi{Sij) be the local time for Z at / up to that time. Given the environment W, 

we have that Xy = Exp (^/y Wy/W,) . Thus we can use Qy := ^T,-,/Ty as an estimator for 

Ry = Wj/W,. 

For a vertex v, we consider a random simple path 7 from vq to v, where each vertex 
X is preceded by the vertex from which x is entered for the first time. This 7 is just the 
backward loop erasure of the process up to the hitting time of v. As for the LRRW, we 
need two estimates. First an analogue of Lemma 7: 

Lemma 17 For any simple path 7 in a finite graph G, any environment W and any s <lwe 
have 

R,V ( ns \h\ 



^"n(S) 



ee-y \>ce/ -smns 

Second, we need an analogue of Lemma 8. Recall from § 2 that &y denotes the event 
that the backward loop erasure from u is a given path 7. We use the same notation here. 

Lemma 18 There exists some C > such that for any < s < 1/4 the following holds. For 
any finite graph G, any conductances J, and simple path 7 starting at vq, 

^Y\Ql^i{&,}<c{st\Y\]f. 

25 



Proof of Lemma 17. Given the environment, the time Z spends at / before jumping to / 

is Exp {yijWj/Wi) which may be written as (2W,//,yW^)Xy, where X,y = Exp(l). Cru- 
cially, given the environment the variables X,y are all independent. For an edge e = [i,]) 
this gives Re/Qe = \f^4]T^i- Therefore 



^"Sd) 



'^^ ( , N / n\IT'I {US 

= ri+sri-s 

by the reflection identity for the F function. D 

Lemma 19 Suppose < s < 1/4 
U, let V = Exp(/(1 + U)). Then 



Lemma 19 Suppose < s < 1/4, and let J > be fixed. Let U = Exp(/) and conditioned on 

D 



\2U + WJ ~ 1-As' ' 

where C is some universal constant. 

Proof. We can reparametrize U = j and V = ^^, where X and Y are independent 
Exp(l) random variables. In term of X, Y we have 

2V + V^ _ 2}^Y{J + X)+ J^Y^ 2fY }^Y^ 
2(i+LJ2 ~ X(/ + X)2(2/ + X) ^ ~W ^ IF' 

We now calculate, using [a + by < a^ + ¥ for < s < 1, 



E (^ + 1^)' < /'^ (e(2YX-3)^ + E(y2x 



2v-4\s 



j2s ( 2'EY'EX-3s ^ ]Ey2s]Ex-4s ) by independence 



C C 

< J^'' since EX" < 



4s ~ 1 + a 

(the inequality EX" < C/(l + fl) holds for \a\ < 1, the relevant range here). D 

Proof of Lemma 18. Consider an edge e = {i,j) G 7, and let T,y be the first time t at which 
Yf jumps from / to j (and similarly define Tjj). On the event &j the process does not 
visit i before the first jump from i to /. Thus Lj{t) = for all t < Tjj. Hence the jump 
z — ?► ; occurs at rate /y whenever Y is at i, and so U := Li{Tjj) has law Exp(/,y). More 
precisely, the statement about the jump rate implies that we can couple the process Y 
with an Exp(/,,) random variable, so that on the event ^-y it equals U. 

Let V = Lj{Tji) be the time spent at ; before the first jump back to /. Since L,(i) > U 
from the time we first enter /, the rate of such jumps is always at least /y (1 + U), we find 
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that Y is stochastically dominated by a Exp(/y(l + \X)) random variable. 

All statements above concerning rates of jumps along the edge e hold (on the event 
Sl^), uniformly in anything that the process does anywhere else. Thus it is possible to 
construct such exponential random variables for every edge e G 7, independent of all 
other edges, so that the M equals the first and V is dominated by the second. The claim 
then follows by Lemma 19, since Ty = 2\1 + LZ^ and Ty, = 2y + V^ by their definitions 
and the time change formulae. D 

5.3 Exponential decay and Theorem 6 

Let Gr denote the ball of radius K around vq. Denote by }v-^' the VRJP measure on Gj? 
and the corresponding expectation by E*-^). In the proof, Ej denote expectation with 
respect to J. Theorem 6 follows from the following: 

Theorem 20 There is a universal constant c > such that the following holds. Let G be a 
fixed graph with degree bound K. Let J = {}e)eGE be a family of independent initial rates with 

lEjy^ < cK-"^. 

Then (a.s. with respect to ]) the measures ji^^^ are a tight family and converge to a limit p. on 
]R^ so that the VRJP is a time change of the process Zs in the environment given by p. The limit 
process is positive recurrent, and the stationary measure decays exponentially. 

The moment condition on / is trivially satisfied in the case that all J/s are bounded by 
some sufficiently small }q. The particular condition comes from specializing to s = 1/5, 
and can be easily changed by taking other values of s or by using Holder 's inequality in 
place of Cauchy-Schwartz in the proof below. The dependence on K may be similarly 
improved. 

Proof. Combining Lemmas 17 and 18 with Cauchy-Schwartz, for any v, any radius R > 

dist(yo/ ^) and any path j : Vq —> v in Gr we have (recall W-og = 1): 



<ci^'n/; 






where Ci depends only on s. Let the c from the statement of the theorem be 1/Ci(^). 
Then with s = 1/5 w^e get 

EjE^Wy^lji^^} < k-4|ti. 

Since the number of paths of length n is at most K" and no path to v is shorter than 
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dist(yoA ^) this implies 

Since this bound is uniform in R, the Borel-Cantelli Lemma, applied with respect to J, 
implies that the measures fi^'^' are tight and that they have subsequential limits J. Let 
}i be any such subsequential limit (we later deduce that }i is unique). It is easy to see 
that the weak convergence of //(^' to f/ implies a convergence of Z^^\ Y'^' and the time 
change between them to Z, Y and the time change between them corresponding to the 
infinite measure (all convergences are along the chosen subsequence). However, from 
the reinforcement viewpoint, Yf has the same law on all Gr until the first time it reaches 
the boundary of the ball. Thus ji yields the VRJP on the infinite graph G. 

As noted above, the discretized Zg is just a random walk on G with conductances 
Cy = JijWiWj. By Markov's inequality P(W„ > K-^^^i^ii^o,^)) < 2X-2dist(j;o,i')_ By Borel- 
Cantelli, it follows that a.s. W-o < x^-^'^^**^"'''^') for all but finitely many v, and therefore 
Ce < j..f(^-(>dist{e,vo) _ However, the number of edges at distance n is at most K" and 
we assumed Je has a finite 1/5 moment so yet another application of the Borel-Cantelli 
Lemma ensures that X^g j^j(^-6dist{vo,v) ^ ^^_ Thus Yle Ce is almost surely finite, and the 
total weight outside Gr decays exponentially. This implies the positive recurrence. 

Finally, since the process is a.s. recurrent, Z visits each vertex infinitely often, and 
the environment can be deduced from the observed jump frequencies along edges, the 
subsequential limit ji is in fact unique. With tightness, this implies convergence of the 

jiR. D 
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